Ultrafast shape recognition for similarity search in molecular databases

نویسندگان

  • Pedro J. Ballester
  • Graham Richards
  • Christian Lessig
چکیده

Molecular databases are routinely screened for compounds that most closely resemble a molecule of known biological activity to provide novel drug leads. It is widely believed that three-dimensional molecular shape is the most discriminating pattern for biological activity as it is directly related to the steep repulsive part of the interaction potential between the drug-like molecule and its macromolecular target. However, efficient comparison of molecular shape is currently a challenge. Here, we show that a new approach based on moments of distance distributions is able to recognize molecular shape at least three orders of magnitude faster than current methodologies. Such an ultrafast method permits the identification of similarly shaped compounds within the largest molecular databases. In addition, the problematic requirement of aligning molecules for comparison is circumvented, as the proposed distributions are independent of molecular orientation. Our methodology could be also adapted to tackle similar hard problems in other fields, such as designing content-based Internet search engines for three-dimensional geometrical objects or performing fast similarity comparisons between proteins. From a broader perspective, we anticipate that ultrafast pattern recognition will soon become not only useful, but also essential to address the data explosion currently experienced in most scientific disciplines.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ultrafast shape recognition to search compound databases for similar molecular shapes

Finding a set of molecules, which closely resemble a given lead molecule, from a database containing potentially billions of chemical structures is an important but daunting problem. Similar molecular shapes are particularly important, given that in biology small organic molecules frequently act by binding into a defined and complex site on a macromolecule. Here, we present a new method for mol...

متن کامل

3D Face Recognition using Patch Geodesic Derivative Pattern

In this paper, a novel Patch Geodesic Derivative Pattern (PGDP) describing the texture map of a face through its shape data is proposed. Geodesic adjusted textures are encoded into derivative patterns for similarity measurement between two 3D images with different pose and expression variations. An extensive experimental investigation is conducted using the publicly available Bosphorus and BU-3...

متن کامل

USRCAT: real-time ultrafast shape recognition with pharmacophoric constraints

UNLABELLED BACKGROUND Ligand-based virtual screening using molecular shape is an important tool for researchers who wish to find novel chemical scaffolds in compound libraries. The Ultrafast Shape Recognition (USR) algorithm is capable of screening millions of compounds and is therefore suitable for usage in a web service. The algorithm however is agnostic of atom types and cannot discrimina...

متن کامل

A Partial Shape Matching Method for 3d Model Databases

The use of 3D models is gaining popularity since they are important for computer graphics applications. Recently, similarity retrieval techniques for 3D models have been investigated intensively for handling databases of 3D models systematically. The techniques extract shape descriptors from 3D models and use these descriptors for indices for comparing shape similarities. Various shape descript...

متن کامل

USR-VS: a web server for large-scale prospective virtual screening using ultrafast shape recognition techniques

Ligand-based Virtual Screening (VS) methods aim at identifying molecules with a similar activity profile across phenotypic and macromolecular targets to that of a query molecule used as search template. VS using 3D similarity methods have the advantage of biasing this search toward active molecules with innovative chemical scaffolds, which are highly sought after in drug design to provide novel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007